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1 From text to hypertext by indexing 

Airi Salminen, Jean Tague-Sutcliffe, Charles McClellan 



January 1995 ACM Transactions on Information Systems (TOIS), volume 13 issue l 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citin gs, index 
. ! terms , review - 



Full text available: 



A model is presented for converting a collection of documents to hypertext by means of 
indexing. The documents are assumed to be semistructured, i.e., their text is a hierarchy 
of parts, and some of the parts consist of natural language. The model is intended as a 
framework for specifying .hypertextual reading capabilities for specific, application areas 
and for developing new automated tools for the conversion of semistructured text to 
hypertext. In the model, two well-known paradigms— ... 

Keywords: constrained grammars, grammars, hypertext, properties, structured text, test 
types, text entities, transient hypergraphs 



Legall y s peaking 
Rosalie Steier 

February 1989 Communications of the ACM, volume 32 issue 2 
Publisher: ACM Press 

Full text available: *j g pdf(331.37 KB) Additional information: full citation , index terms 



3 Filterin g and retrieval models: An empirical study on retrieval models for different jgTf 
<g> document genres: patents and newspaper articles 
^ Makoto Iwayama, Atsushi Fujii, Noriko Kando, Yuzo Marukawa 

July 2003 Proceedings of the 26th annual international ACM SIGIR conference on 

Research and development in informaion retrieval SIGIR '03 
Publisher: ACM Press 

Full text available: fiB pdf(419.99 KB) Additional Information: full citation , abstract,, references , citings, index 
10 7. 7\ ■ ' terms : - - : ii sr-mi'v.! ;rn ; : v : 

Reflecting the rapid growth in the utilization of large test collections for information 
retrieval since the 1990s, extensive comparative experiments have been performed to 
explore the effectiveness of various retrieval models. However, most collections were 
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intended for retrieving newspaper articles and technical abstracts. In this paper, we 
describe the process of producing a test collection for patent retrieval, the NTCIR-3 Patent 
Retrieval Collection, which includes two years of Japanese pat ... 

Keywords: patent retrieval, retrieval models, test collections 



LID 1: Intellectual property aspects of web publishing 
Holger M. Kienle, Daniel German, Scott Tilley, Hausi A. Muller 

October 2004 Proceedings of the 22nd annual international conference on Design of 
communication: The engineering of quality documentation SIGDOC '04 

Publisher: ACM Press 

Full text available* ffipdf(153 91 KB) Addjtional Information: full citation , abstract , references , citings , index 
' : terms 

This paper addresses how intellectual property affects the Web in general, and content 
publishing on the Web in particular. Before its commercialization, the Web was perceived 
as being free and unregulated; this assumption is no longer true. Nowadays, content 
providers need to know which practices on the Web can result in potential legal problems. 
The vast majority of Web sites are developed by individual such as technical writers or 
graphic artists, and small organizations, which receive Mm ... 

Keywords: copyright, hypermedia, intellectual property, link law, open content, patents, 
trademarks, world wide web ■ . : 



5 A patent search and classification system 

Leah S. Larkey "'■ : " : ~- n 

August 1999 Proceedings of the fourth ACM conference on Digital libraries DL '99 

Publisher: ACM Press 

Full text available: pdf(1 64.37 KB) Additional Information: full citation , references , citings , index terms 



Keywords: applications, classification, digital libraries, information retrieval, patents, 
systems, text categorization 



Evaluating document retrieval in patent database: a preliminary report 
Mark Osborn, Tomek Strzalkowski, Mihnea Marinescu 

January 1997 Proceedings of the sixth international conference on Information and 
knowledge management CIKM '97 

Publisher: ACM Press 

Full text available: *g pdf(811.55 KB) Additional Information: full citation , references , citings , index terms 



7 NSF workshop on industrial/academic cooperation in database systems 

❖ Mike Carey, Len Seligman 
March 1999 ACM SIGMOD Record, volume 28 issue l 
Publisher: ACM Press 

Full text available: Q pdf(1. 96 MB) Additional Information: full citation , index terms 
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text databases into hierarchical topic taxonomies 

Soumen Chakrabarti, Byron Dom, Rakesh Agrawal, Prabhakar Raghavan 

August 1998 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 7 Issue 3 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^ pdf(281.37 KB) Additional Information: full citation , abstract , citings , index terms 

We explore how to organize large text databases hierarchically by topic to aid better 
searching, browsing and filtering. Many corpora, such as internet directories, digital 
libraries, and patent databases are manually organized into topic hierarchies, also called 
taxonomies. Similar to indices for relational data, taxonomies make search and access 
more efficient. However, the exponential growth in the volume of on-line textual : 
information makes it nearly impossible to maintain such taxono ... 

9 A functional taxonomy for software watermarking 
Jasvir Nagra, Clark Thomborson, Christian Collberg 

January 2002 Australian Computer Science Communications , Proceedings of the 

twenty-fifth Australasian conference on Computer science - Volume 4 
ACSC '02, Volume 24 Issue 1 

Publisher: Australian Computer Society, Inc., IEEE Computer Society Press 

Full text available: 1jg |pdf(1.19 MB) Additional Information: full citation , abstract , references, citings , index 
^ terms 

Despite the recent surge of interest in digital watermarking technology from the research 
community, we lack a comprehensive and precise terminology for software watermarking. 
In this paper, we attempt to fill that gap by giving distinctive names for the various 
protective functions served by software watermarks: Validatiori Mafk, Licensing Mark, 
Authorship Mark and Fingerprinting Mark. We identify the desirable properties and specific 
vulnerabilities of each type of watermark, and we illustrate ... 

Keywords: authentication, fingerprint, software authorship, software licensing, 
stega nog raphy, watermark 



10 Frontmatter (TOC. Letters. Open Source Software (OSS) Patent Search Engine. 
<g> Calendar of Events. Workshop and Conference InformationV 
^ ACM SIGSOFT Software Engineering Notes staff 

March 2005 ACM SIGSOFT Software Engineering Notes, volume 30 issue 2 

Publisher: ACM Press 

Full text available: I Sl pdf(564.92 KB) Additional Information: full citation .' index terms ^=^i'* f j - 



11 Digital signets: self-enforcing protection of digital information (preliminary version) 
Cynthia Dwork, Jeffrey Lotspiech, Moni Naor 

July 1996 Proceedings of the twenty-eighth annual ACM symposium on Theory of 
computing STOC '96 

Publisher: ACM Press , V..J-. 

Full text available: ^pdf(1.24 MB) Additional Information: full citation , references , citings , index terms 



12 Revisiting Document Length Hypotheses: A Comparative) StudV/.of Japanese 

Newspaper and Patent Retrieval 
^ Sumio Fujita 

June 2005 ACM Transactions on Asian Language Information Processing (TALIP), 

Volume 4 Issue 2 
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Publisher: ACM Press 

Full text available: ^ pdf(337.61 KB) Additional Information: full citation , index terms 



13 Software watermarking: models and dynamic embeddinqs 
Christian Collberg, Clark Thomborson 

January 1999 Proceedings of the 26th ACM SIGPLAN-SIGACT symposium on 
Principles of programming languages POPL '99 

Publisher: ACM Press 

Full text available: ^ pdf(2.19 MB) . Additional Information: full citation , references , citings , index terms 



14 Linux Gazette: History of the Portable Network Graphics (PNG) Format 
Greg Roelofs 
April 1997 Linux Journal 

Publisher: Specialized Systems Consultants, Inc. 

Full text available: g) html(21.17 KB) Additional Information: full citation , references , citings , index terms 



15 Can text analysis tell us something about technology progress? 
Khurshid Ahmad, AbdulMohsen Al-Thubaity 

July 2003 Proceedings of the ACL-2003 workshop on Patent corpus processing - 
Volume 20 

Publisher: Association for Computational Linguistics 

Full text available: ^5[ pdf(1 13.52 KB) Additional Information: full citation , abstract , references 

A corpus-based diachronic analysis of patent documents, based mainly on the 
morphologically productive use of certain terms can help in tracking, the evolution of key 
developments in a rapidly e volving specialist field. The patent texts were obtained from 
the US Patent & Trade Marks Office's on-line service and the terms were extracted 
automatically from the texts. The chosen specialist field was that of fast-switching devices 
and systems. The method presented draws from liter ature on biblio- a ... 

16 A common architecture for different text processing techniques in an information 
retrieval environment 
G. Thurmair 

September 1986 Proceedings of the 9th annual international ACM SIGIR conference 
on Research and development in information retrieval SIGIR '86 

Publisher: ACM Press • • 

Full text available: pdf(734.05 KB) Additional Information: full citation , abstract , references , citings 

The following paper gives an overview on a text processing software called REALIST 
(Retrieval Aids by Linguistics and Statistics) which integrates different text processing 
techniques into a common surface. It supports the user by offering the environment of a 
given term, using morphological, syntactic and statistic means. .The. user ; can call up the 
processing results, use it for indexing, classification or retrieval purposes and combine 
them as he wishes e.g. to set up a search logic. The te ... 

17 Extracting sentence segments for text summarization: a machine learning approach 
* Wesley T. Chuang, jihoon Yang 

July 2000 Proceedings of the 23rd annual international ACM SIGIR conference on 
Research and development in information retrieval SIGIR v OO 

Publisher: ACM Press 

Additional Information: 
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Full text available: ^ pdf(945.26 KB) full citation , abstract , references , citings , index 

terms 

With the proliferation of the Internet and the huge amount of data it transfers, text 
summarization is becoming more important. We present an approach to the design of an 
automatic text summarizer that generates a summary by extracting sentence segments. 
First, sentences are broken into segments by special cue markers. Each segment is 
represented by a set of predefined features (e.g. location of the segment,, average term 
frequencies of the words occurring in the segment, number of title words ... 

Keywords: machine learning, sentence segment extraction, text summarization 



18 Secure virtual private networks: the future of data communications 
Eli Herscovitz 

August 1999 International Journal of Network Management, volume 9 issue 4 
Publisher: John Wiley & Sons, Inc. 

Full text available: pdf(230.05 KB) Additional Information: full citation , abstract , index terms 

The Internet is an almost ideal means for information retrieval and exchange. It is cost- 
effective, easy to use and easily accessible. However, it can also be susceptible to devious 
practices such as data tempering, eavesdropping and theft. This paper analyses secure 
virtual private networks &lpar;VPNs&rpar; and their use in countering the problems of the 
Internet. Copyright © 1999 John Wiley & Sons, Ltd. 

19 Making global digital libraries work: collection services, connectivity regions. ?nd 
collection views ,T ./ '\\ ' V.j , ; ~ 
Carl Lagoze, David Fielding, Sandra Payette 

May 1998 Proceedings of the third ACM conference on Digital libraries DL "98 
Publisher: ACM Press - j, » , 

Full text available: ^ pdf( 1.25MB) Additional Information: full citatibfn references : citings , index terms 



20 Tamper-proofing software watermarks Q 
Clark Thomborson, Jasvir Nagra, Ram Somaraju, Charles He 

January 2004 Proceedings of the second workshop on Australasian information 
security, Data Mining and Web Intelligence, and Software 
Internationalisation - Volume 32 ACSW Frontiers '04 

Publisher: Australian Computer Society, Inc. 

Full text available: I pl pdf(249 93 KB) Additional information: full citation , abstract , references , citings, index 

! terms 

We introduce a novel method called constant encoding, which can be used to tamper- 
proof a software watermark that is embedded in the dynamic data structures of a 
program. Our novel tamper-proofing method is based on transforming' numeric or non- 
numeric constant values in the text of the watermarked program into function calls whose 
value depends on the watermark data structure. Under reasonable assumptions about the 
knowledge and resources of an attacker, we argue that no attacker can be c ... 
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